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Abstract 

Spectral embedding of graphs uses the top k eigenvectors of the random walk matrix to 
embed the graph into M fe . The primary use of this embedding has been for practical spec- 
tral clustering algorithms [SM00, NJW01]. Recently, spectral embedding was studied from a 
theoretical perspective to prove higher order variants of Cheeger's inequality [LOT12, LRTV12]. 

We use spectral embedding to provide a unifying framework for bounding all the eigenvalues 
of graphs. For example, we show that for any finite graph with n vertices and all k > 2, the 
fcth largest eigenvalue is at most 1 — £l(fc 3 /n 3 ), which extends the only other such result known, 
which is for k = 2 only and is due to [L081]. This upper bound improves to 1 — il(k 2 /n 2 ) if 
the graph is regular. We generalize these results, and we provide sharp bounds on the spectral 
measure of various classes of graphs, including vertex-transitive graphs and infinite graphs, in 
terms of specific graph parameters like the volume growth. 

As a consequence, using the entire spectrum, we provide (improved) upper bounds on the 
return probabilities and mixing time of random walks with considerably shorter and more direct 
proofs. Our work introduces spectral embedding as a new tool in analyzing reversible Markov 
chains. Furthermore, building on [Lyo05], we design a local algorithm to approximate the 
number of spanning trees of massive graphs. 
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1 Introduction 

A very popular technique for clustering data involves forming a (weighted) graph whose vertices 
are the data points and where the weights of the edges represent the "similarity" of the data points. 
Several of the eigenvectors of one of the Laplacian matrices of this graph are then used to embed 
the graph into a moderate-dimensional Euclidean space. Finally, one partitions the vertices using 
/c-means or other heuristics. This is known as spectral embedding or spectral clustering, and it 
is applied in various practical domains (see, e.g., [SMOO, NJW01, Lux07]). Recently, theoretical 
justifications of some of these algorithms have been given. For example, [LOT12, LRTV12, DJM12] 
used spectral embedding to prove higher order variants of Cheeger's inequality, namely, that a graph 
can be partitioned into k subsets each defining a sparse cut if and only if the kth smallest eigenvalue 
of the normalized Laplacian is close to zero. 

Spectral embedding for finite graphs is easy to describe. For simplicity in this paragraph, let 
G = (V, E) be a d-regular, connected graph, and let A be the adjacency matrix of G. Then the 
normalized Laplacian of G is C := I — A/d. Let gi, ■ ■ ■ ,gk be orthonormal eigenfunctions of C 
corresponding to the k smallest eigenvalues Ai, . . . , Then (up to normalization) the spectral 
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embedding is the function F : V — > R fc 1 denned by 

F(v) := (g2(v),g 3 (v), . . .,g k {v)) . 

This embedding satisfies interesting properties, including one termed "isotropy" (see Section 3 for 
a detailed discussion). This isotropy property says that for any unit vector v £ 

^d(v,F(x)) 2 = l. 

x£V 

The embedding is naturally related to the eigenvalues of C For example, it is straightforward that 

(k-l)X k >^2\\F(x)-F(y)\\ 2 . (1.1) 

Let the energy of F be the value of the right-hand side of the above inequality. It follows from 
the variational principle that the spectral embedding is an embedding that minimizes the energy 
among all isotropic embeddings. (Note that the embedding that only minimizes the energy is the 
one that maps every vertex to the same point in ]R fc_1 .) 

In this paper, we use spectral embedding as a unifying framework to bound from below all the 
eigenvalues of the normalized Laplacian of (weighted) graphs. We prove universal lower bounds 
on these eigenvalues, equivalently, universal upper bounds on the eigenvalues of the random walk 
matrix of G. The usual methods for obtaining such bounds involve indirect methods from functional 
analysis. By contrast, our method is direct, which leads to very short proofs, as well as to improved 
bounds. By (1.1), all we need to do is to bound from below the energy of an isotropic embedding. 
We use simple properties of Hilbert spaces, as well as underlying properties of G, to achieve this 
goal. 

There have been a great many papers that upper-bound the return probability or the mixing 
time of random walks. It is known that return probabilities are closely related to the vertex 
spectral measure (see Lemmas 4.6 and 4.7 for a detailed proof). Therefore, once we can control 
the eigenvalues, we can reproduce, or even improve, bounds on return probabilities. Our work thus 
introduces spectral embedding as a new tool in analyzing reversible Markov chains. 

1.1 Results 

In order to give an overview of our results, we need the following notation, which is explained in more 
detail in Sections 2 and 4. To simplify, we consider only unweighted graphs in this introduction. 
Let A be the adjacency matrix of a connected graph G = (V, E) and D be the diagonal degree 
matrix. The normalized Laplacian matrix is C := I— D~ l l 2 AD~ l l 2 . If G is finite of size n, then the 
eigenvalues of C are = Ai < A2 < • • • < A n < 2. If G is infinite, there may not be any eigenvectors 
in £ 2 (V), so one defines instead a spectral probability measure \i x on [0,2] corresponding to each 
vertex x G V . One way to define \i x is via random walks. Consider lazy simple random walk, which 
stays put with probability 1/2 and moves to a random uniform neighbor otherwise. Write pt(x,x) 
for the probability that random walk started at x is back at x on the tth step. Then 

Pt (x,x) = [\l - X/2) 1 dfi x (X) . 
Jo 
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In the finite case, we define fi := ^2 xe y fJ-x/n, where n := \ V\. In this case, fi(6) = max.{k/n : < 
5}. Write tt(x) for the degree of x divided by |V|, which is when G is infinite. It will be more 
convenient to use fi* := \i x — 7r(x)lo and fj,* := fi — lo/n, where lo denotes the point mass at 0. 

Our main contributions are the following results, all of which we believe to be new, as well as 
the technique used to establish them. The sharpness of these results (up to a constant factor) is 
discussed briefly here and in more detail in the body of the paper. 

Theorem 1.1. For every finite, unweighted, connected graph G, and every 5 6 (0,2), we have 
H*(6) < 14.85 1 / 3 and 

(k — l) 3 



A fc > 



3200ra 3 



Thus, for every integer t > 1, we have 



52 xeV Pt(x,x) - 1 17_ 
n tV3 • 

Here, the first result is sharp for each k separately and the second result is sharp. 

Our main implication of the above result is a fast local algorithm for approximating the number 
t(G) of spanning trees of a finite massive graph, G. The problem of counting the number of 
spanning trees of a graph is one of the fundamental problems in graph theory, for which the 
Matrix- Tree Theorem gives a simple 0(n 3 )-time algorithm. For very large n, however, even this 
is too slow. For a general graph, t(G) can be as large as n n ~ 2 , which is its value for a complete 
graph by Cayley's theorem [Cay89]. 

A local graph algorithm is one that is allowed to look only at the local neighborhood of random 
samples of vertices of the graph. The notion of graph-parameter estimability involves estimating a 
graph parameter, such as r(G), using a local graph algorithm (see, e.g., [ElelO] or [Lovl2, Chap. 22] 
for a discussion). We prove that t(G) is estimable in this sense. In fact, we prove estimability in 
an even stronger sense. Suppose that we have access to G only through an oracle that supports 
the following simple operations: 

• Select a uniformly random vertex of G. 

• For a given vertex x £ V, select a uniformly random neighbor of x. 

• For a given vertex x £ V, return w(x). 

The proof of the next corollary presents a local algorithm for approximating the number of spanning 
trees of G that uses an oracle satisfying the above operations, as well as knowledge of n and \E\. 
For any given e > 0, our algorithm approximates ^logr(G) within an e-additive error using only 
0(poly(e~ 1 logra)) queries. 

Corollary 1.2. Let G be a finite, unweighted, connected graph. Given an oracle access to G 
that satisfies the above operations, together with knowledge of \V\ and \E\, there is a randomized 
algorithm that for any given e, 5 > 0, approximates logr(G)/|V| within an additive error of e, with 
probability at least 1 — 5, by using only 0(e~ 5 + e _2 log 2 |y|)log<5 -1 many oracle queries. 

The preceding Theorem 1.1 gave an Q ((k— l) 3 /n 3 ) bound for With the additional hypothesis 
of regularity, this can be improved to Onfc — l) 2 /n 2 ). In fact, only a bound for the ratio of the 
maximum degree to the minimum degree is needed. 
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Theorem 1.3. For every unweighted, connected, regular graph G and every x G V, we have 
H* x {5) < 10y/6. Hence, p*(5) < 10y/S. If G is finite, for 1 < k < n, we get 

x , (fc ~ 1)2 

k 100n 2 • 

For all t > and x G V , we have 

Pt(x,x) - 7r(x) < -j= . 

This result is evidently sharp as shown by the example of a cycle, which also shows sharpness 
of the next result. 

For a finite G, let Too(l/4) denote the uniform mixing time, i.e., the time t until \pt(x, y)/ir{y) — 
1| 5; 1/4 for every x,y G V. 

Proposition 1.4. For every unweighted, finite, connected regular graph G, we have 

Too (1/4) < 24n 2 . 

The next theorem answers (up to constant factors) the 5th open question in [MT06], which 
asks how small the log-Sobolev and entropy constants can be for an n-vertex unweighted connected 
graph. 

Theorem 1.5. Write p(G) for the log-Sobolev constant and po(G) for the entropy constant of G. 
For finite unweighted graphs G with n vertices, we have 

min p(G) = 9(n~ 3 ) and mm p (G) = 6(n" 3 ) . 

G G 
Similarly, we find the worst uniform mixing time of graphs: 
Proposition 1.6. For every unweighted, finite, connected graph G, we have 

Too(l/4) <8n 3 . 

This result is sharp. 

The preceding three results have been known implicitly in the sense that they could have been 
easily deduced from known results, but for some reason, they were not. 

Finally, the case of transitive graphs is especially interesting and especially well studied, yet, to 
the best of our knowledge, the following theorem has not been proved in this generality. 

Theorem 1.7. For every connected, unweighted, vertex-transitive, locally finite graph G of degree 
d, every a G (0, 1), 5 G (0, 2), and every x G V, 

where N(r) denotes the number of vertices in a ball of radius r. 

Our technique yields very short proofs of the above results. In addition, one can immediately 
deduce such results as that return probabilities in infinite transitive graphs with polynomial growth 
at least order a decay at polynomial rate at least order a/2. This is, of course, the correct decay 
rate on Z a for a G N. 
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1.2 Related Works 



There have been many studies bounding from above the eigenvalues of the (normalized) Laplacian 
(equivalently, bounding the eigenvalues of the (normalized) adjacency matrix from below). For 
example, Kelner et al. [KLPT11] show that for n-vertex, bounded-degree planar graphs, one has 
that the kth smallest eigenvalue satisfies Afc = 0(k/n). 

However, to the best of our knowledge, universal lower bounds were known only for the second 
smallest eigenvalue of the normalized Laplacian. Namely, Landau and Odlyzko [L081] showed that 
the second eigenvalue of every simple connected graph of size n is at least 1/n 3 . 

On the other hand, there have also been a great many papers that bound from above the return 
probabilities of random walks, both on finite and infinite graphs. Such bounds correspond to lower 
bounds on eigenvalues. In fact, as we review in Subsection 4.2, the asymptotics of large-time return 
probabilities correspond to the asymptotics of the spectral measure near 0, which, for finite graphs, 
means the behavior of the smallest eigenvalues. 

Our methods would work as well for the eigenvalues Afc of the unnormalized combinatorial 
Laplacian L. In this case, [Fri96] has determined the minimum of Afc for each k over all unweighted 
n-vertex graphs. As noted there, his bound implies that Afc = Q(k 2 /n 2 ); this immediately implies 
that Afc = $7(/c 2 /n 3 ) by comparison of Rayleigh quotients, but this is not sharp, as indicated by 
Theorem 1.1. 

1.3 Structure of the Paper 

After a short section of basic notations, we give an overview of our proof techniques by giving a 
proof of Theorem 1.3 for finite graphs. Here the regularity of the graph simplifies notation, but also 
provides powerful geometric constraints on the eigenvalues. We then resume with the remainder 
of the required background in Section 4. After this background has been reviewed, Section 5 gives 
some very simple proofs of known results, followed by proofs of new results that lead to the above 
bounds on mixing time, log-Sobolev constants, and entropy constants. Our most sophisticated 
proof is in Section 6, which establishes Theorem 1.1 and its corollaries. The case of transitive 
graphs is treated in Section 7, while the appendix collects some proofs of known results for the 
convenience of the reader. 

2 Graph Notation and the Laplacian 

Let G = (V, E) be a finite or infinite, weighted, undirected, connected graph. Since we allow 
weights, we do not allow multiple edges. Also, we do not allow loops. Thus, G is always a simple 
graph. If G is finite, we use n := \V\ to denote the number of vertices. For each edge (x,y) 6 E, 
let w(x,y) > be the weight of (x,y). In almost all instances, throughout the paper we assume 
that w(x,y) > 1 for every edge (x,y) £ E. However, we make this assumption explicit each time. 
We say G is unweighted if w(x,y) = 1 for every edge (x,y) E E. 

For each vertex x G V, let w(x) := J2 y ~ x w ( x > u) De the (weighted) degree of x in G. Since 
G is connected, w(x) > for all x £ V. For a set S C V, let wt(5) := YlxtS w(x). Similarly, 
let wt(E') := YleeE' w ( e ) f° r E' C E. Also let nbd(x) be the set of neighbors of x in G. We 
define the function tt: V — > R by letting tt(x) := for all vertices in infinite graphs, while ir(x) := 
w(x) / ^2 y&v w(y) for finite graphs. 
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For a vertex x G V, we use l x to denote the indicator vector of x, 

i x (y) ■-- 



1 if y = x, 
otherwise. 



We also use e x := l x / w(x). For two vertices x,y G V, we use dist(x,y) to denote the length 
of a shortest path from x to y. For every r > 0, we write B^- lst (x,r) := {y : dist(x,y) < r} to 
denote the set of vertices at distance at most r from x. Define diam(x) := sup y dist(x, y) and 
diam := max x . diam(x). For a vertex x G V and radius r > 0, let 

wt(x,r) := wt (B dist (x,r)) := w(y) . 

y:dist(x,y)<r 

Recall the gamma function, r(z) := J °° e~ l t z ~ l dt. 

We write £ 2 (V,w) for the (real or complex) Hilbert space of functions /: V — > R or C with 
inner product 

(/,5>to : = ^ (^/(^(aj) 

and norm ||/||^ := {f,f) w - We reserve (•,•) and || • || for the standard inner product and norm on 
£ 2 {V). 

We now discuss some operators on £ 2 {V). The adjacency operator is defined by Af(x) := 
J2 y ~x w ( x iy)f(y)' an d the diagonal degree operator by Df(x) := w{x)f(x). Then the combina- 
torial Laplacian is defined by L := D — A, and the normalized Laplacian is given by 

C ■= D~ 1/2 LD~ 1/2 = I - D~ 1/2 AD~ 1/2 . 

Observe that for a d-regular graph, we have C = ^L. 

Now, if g: V — > R is a non-zero function and / = -D _1 / 2 <7, then 

(g,Cg) _ (f,Lf) _ J2^ v Hx,y)\f(x)-f(y)\ 2 _ 
(9,9) </,/>« E v evMx)f(x) 2 • ayG[J> 

is called the Rayleigh quotient of f (with respect to G). Note that the sum over x ~ y is over 
unordered pairs, i.e., over all undirected edges. 

In particular, when G is finite, one sees that £ is a positive semi-definite operator with eigen- 
values 

= Ai < A 2 < • • • < A n < 2 . 

Since G is connected, the first eigenvalue corresponds to the eigenfunctions g = D 1 / 2 /, where / is 
any non-zero constant function. Furthermore, by standard variational principles, 

A fc = min max ( iffl— ^ ■ g e span-j^i, ...,g k } 

= t m™,^ max{Ray G (/) : / G span{/i,...,/ fc }| , (2.1) 

where both minima are over sets of k non-zero linearly independent functions in the Hilbert spaces 
£ 2 (V) and £ 2 (V,w), respectively. We refer to [Chu97] for more background on the spectral theory 
of the normalized Laplacian. 
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3 A Proof Exemplar: Regular Graphs 



In this section, we show that the eigenvalues of C on regular graphs satisfy A& = U(k 2 /n 2 ) for 
k > 2. The proof exhibits our techniques in a simple case without requiring much of the notation 
that will be needed later. 

Theorem 3.1. For any connected, unweighted, regular graph G and any k > 1, 

A > (fc ~ 1)2 
k ~ WOn 2 

An extension of the above theorem to infinite graphs is given in Theorem 5.10. The proof applies 
as well to finite graphs. Since the statement and proof work for both finite and infinite graphs, 
they require a different notation. It is instructive to see how this proof can written in a different 
notation for the spectral embedding. 

Let d denote the degree of G. We start by defining the spectral embedding of graphs, which is 
our main technical tool (the general definition can be found in Subsection 4.4). Let g\, c/2> • • • , 9k 
be orthonormal eigenvectors for the lowest k eigenvalues of C, and let fi := D^^gi = gi/Vd for 
all 1 < i < k. The spectral embedding of G is defined as follows: 

F(x) := {f 2 (x)J 3 (x),...,f k (x)). 

The spectral embedding satisfies the following three properties that will be essential in our proofs. 
Each property statement is followed by its proof. 

Isotropy: For every unit vector v G R fc_1 , ^ d(v, F(x)} 2 = 1: 

d(v, F(x)) 2 = v T ( dF(x)F(x) T J v = v T /v = ||v|| 2 = 1 . 

X \ X / 

Average Norm: ± £ x d \\F(x)\\ 2 = 

Add up the isotropy equations over the k — 1 vectors v of an orthonormal basis. 
Energy: For every unit vector v G R^ 1 and / G i 2 {V) defined by f(x) := (v, F(x)), 

En/^)-/(y)ii 2 = Ra yG(/)<^ : 

The equality follows since by isotropy, d ||/|| 2 = 1, and the inequality follows by the variational 
principle and the fact that / G span{/2, . . . , fk}- 

We next give an overview of the proof. The idea is to choose a vertex x far from the origin in the 
spectral embedding, i.e., ||F(x)|| 2 = 0(/c/(nd)). We consider a ball B C of radius ||F(x)|| /2 

centered at F{x). We bound below by lower-bounding the energy of a function / = (v,F) with 
f(x) = ||F(x)|| along the shortest path from x to the vertices outside of B. To obtain a good lower 
bound, we need to show that the length r of this path is 0(n/{kd)). We use the regularity of the 
graph to show that the shortest path has length 0(jB\/dj. Then we use the isotropy property to 
show that \B\ = 0{n/k). Together, these give the bound we want on the length of the path. Using 
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the starting value of /, we obtain that = il(-F(x) 2 /r) = f)(A; 2 /n 2 ), which completes the proof 
of the theorem. 

We now begin the actual proof. By the average-norm property, there is a vertex x £ V such 
that ||F(x)|| 2 > We define / G £ 2 {V) by 



In particular, observe that f(x) = ||F(a;)||. By the energy property, > Ray G (/), so it suffices to 
show that Ray G (/) = fl(f{x) 2 kd/n) = fl(k 2 /n 2 ). 
Let 

B:={y:\f(y)-f(x)\<\f(x)\/2}. 

First, by the isotropy property, 

1 = E d ■ f(y) 2 * E d /(*) 2 /4 = \B\d \\F{x)\\ 2 /4 > m \~ l) ■ (3.1) 

y&V yeB 

Second, we show that B 7^ V. Since / G span{/2, . . . , we have (/, f\) = 0, i.e., ^2 y f(y) = 0. 
But since f(y) > for each y G B, we must have B 7^ V. Since \B\ 7^ V and G is connected, there 
is a path from x to outside of B. Let "P = (yo, yi, ■ ■ ■ , y r ) be the shortest path from x to any vertex 
outside of B. Thus yo = x, y r ^ B, and the rest of the vertices are in B. 

We consider two cases. First, if | nbd(x) n B\ < d/2, then by the energy property, 

A fc >Ei/(y)-/(-)i 2 > E i/w-/(y)i 2 >^( ll^)|| /2) 2 >^^ 

y^z y^B:y~x 

and we are done. Otherwise, we can show that \B\ > dr/6. This is because if r = 1, the fact that 
I nbd(x) n B\ > d/2 implies \B\ > d/2. If r > 1, then since G is regular, \B\ > d(r — l)/3 > dr/6 
by Lemma 5.6. Therefore, by the energy property, 



r-l /r-1 \ 2 

A fc > El^)-^+i)! 2 ^- E[/(f*)-/(l/i+i)] =-(/(x)-/(y r )) 

i=0 r \i=0 / r 

> ||F(x)f > d\\F(x)\\ 2 > (^-l) 2 
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4r ~ 24|J3| ~ 96n 2 ' 

where the second inequality follows by the Cauchy-Schwarz inequality, the third inequality follows 
by the fact that y r £ B, and the last inequality follows by (3.1). This proves Theorem 3.1. 



4 Spectral Measure and Spectral Embedding 

4.1 Spectral Measure 

The spectral theory of the Laplacian generalizes naturally to infinite graphs. However, eigenvalues 
may not exist. Instead, one defines a probability measure on the spectrum; in the finite-graph case, 
this amounts to assigning equal weight to each eigenvalue. Next, we describe the spectral theory 
of infinite locally finite graphs that may be equivalently applied to finite graphs. 
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We briefly review the spectral theorem for bounded self-adjoint operators T on a complex 
Hilbert space 7~L. For more details, see, e.g., [Rud91, Chap. 12]. Let 8$ be the Borel cr-field in R. 
A resolution of the identity I(-) is a map from SS to the space of orthogonal projections on 
T~L that satisfies properties similar to a probability measure, namely, 1(0) = 0; /(R) = for all 
Bi,B 2 S SB, wehave/( J Bin5 2 ) = I(B 1 )I(B 2 ) and, if Bi(~)B 2 = 0, then I(B X \JB 2 ) = I(Bi)+I(B 2 ); 
and for all /, g £ H, the map B \— > (1(B) f,g) is a finite complex measure on £$. Note that 
B i — y (1(B) f,f) = ||/(i3)/|| 2 is a positive measure of norm ||/|| 2 . The spectrum of T is the set of 
A E R such that T — XI does not have an inverse on %. The spectral theorem says that there is a 
unique resolution of the identity, It(-), such that T = J XcIIt(X) in the sense that for all f,g£H, 
we have (Tf,g) = j Xd(lT(X)f,g). Furthermore, It is supported on the spectrum of T. For a 
bounded Borel-measurable function h: R — > R, one can define (via what is called the symbolic 
calculus) a bounded self-adjoint operator h(T) by h(T) := J h(X) dlT(X), with integration meant 
in the same sense as above. The operator h(T) commutes with T. For example, 1b(T) = It(B). 
In our case, T will be positive semi-definite, whence its spectrum will be contained in R + . In this 
case, we will write It(S) '■= It ([0,5]) for its cumulative distribution function (5 > 0). 

For example, if % = L 2 (X,u) and g € L°°(X,ij,), then the multiplication operator M g defined 
by M g : f — )■ g ■ f is a bounded linear transformation, which is self-adjoint when g is real. In this 
case, I Mg (B) = M lg _ 1[B] and h(M g ) = M hog . 

If T~L is finite dimensional and T has spectrum a, one could alternatively write It{B) = 
Ylx&anB where P\ is the orthogonal projection onto the A-eigenspace. In particular, It(S) = 
^2x<s Writing T = ^Aeo- amoun ts to diagonalizing T. Here we have h(T) = J2\ea ^W^x 
for any function h; because only finitely many values of h are used, we may take h to be a polyno- 
mial. 

Let G be a locally finite graph. Let Ic(') be the resolution of the identity for the operator 
C. The Laplacian £ is a positive semi-definite self-adjoint operator acting on £ 2 (V) with operator 
norm at most 2, so its spectrum is contained in [0, 2]. We may use /(•) whenever the operator is 
clear from context. For a vertex x £ V and 5 > 0, the function 



is called the vertex spectral measure of x. It defines a probability measure on the Borel sets of 
R supported on [0, 2]. If G is finite, then the spectral measure of G is defined as 



For general infinite graphs, there is no corresponding spectral measure, other than the projection- 
valued Ic- Of course, if G is transitive, then /i x (5) does not depend on x G V, and in this case, [i x 
is already an analogue of [i. 

For infinite graphs with infinite volume, there is no kernel of C, so 1(0) = 0. However, for finite 
graphs, 1(0) is the projection on the kernel of C, which is the space of multiples of ^Jlx. Since 
we are not interested in the kernel of C, it will be convenient for us to work with the operator 
I* (5) := 1(5) — 1(0). Correspondingly, we define 



a x (5) := (I c (6)l x , l x 



(4.1) 




(4.2) 



x&V 
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Observe that n*(5) = (i x (5) — ir(x). Therefore, for every connected graph G and every vertex x G V, 
we have /U*(0) = and a*(2) = 1 — n(x). Furthermore, n*(5) = a(8) — l/n when G is finite. 

Example 4.1. Consider the unweighted cycle on n vertices, which we regard as the usual Cayley 
graph of Z n := Z/nZ. The Fourier transform T maps £ 2 (Z n ) isometrically isomorphically to 
£ 2 (JL n ,v), where v is the uniform probability measure on Z n; and carries C to the multiplication 
operator M g , where g(k) := 1 — cos(2-7r/c/n). The eigenvalues of C are the values (with multiplicity) 
of g. Since C = T~ x M g T, we have that Ic = T~ x lM g ^F- Clearly, /m 9 [0, A] = M\ B ^, where 
B\ := {k : g(k) < A}. Therefore, u x (5) = \B$\/n for all x G Z n . 

Example 4.2. Consider the usual unweighted Cayley graph ofL. The Fourier transform P maps 
L 2 (IR/Z) isometrically isomorphically to £ 2 (Z) and carries the multiplication operator M g to C, 
where g(s) := 1 — cos(27rs). Since C = FM g F~ l , we have that Ic = TIm 3 ^F~ X ■ Clearly, 7m 9 [0, A] = 
M\ B ^, where B\ := {s : g(s) < A}. Therefore, n x {5) = \B$\ for all x G Z. 

Since both 1(5) and I* (5) are functions of C, it follows that they commute with C 

Fact 4.3. For every graph G, and 5 G (0,2), 1(5) and I* (5) commute with C, i.e., I(5)C = CI(5), 
and I*(5)C = CP (5). 

It is straightforward that characterizing spectral measure of finite graphs provides a correspond- 
ing characterization for the eigenvalues of the normalized Laplacian. 

Fact 4.4. For every finite graph G and 2 < k < n, if /j,*(5) < (k — l)/n, then A& > 5. 

The next lemma is a generalization of the Rayleigh quotient to infinite graphs. 

Lemma 4.5. Let f G f(V) and 5 G [0,2]. If D 1 / 2 f G img (1(8)), then 

(LfJ)<5(fJ) w = 5Y J w(x)f(x) 2 . 

x&V 

Proof. Let g = D x l 2 f. Since g G img (l(5)), for every 82 > 5\ > 5, we have (1(82) — I(5±))g = 0. 
Therefore, 

(Lf, f) = (Cg, g)= f A d(I(X)g, g) = I A d(I(X)g, g) + [ X d(I(X)g, g) 

J [0,2] J [0,5] J (5,2] 

<S(g,g)=5(f,f) w . □ 

4.2 Random Walks 

We shall consider lazy random walks on G, where from each vertex x, with probability 1/2 
we stay at x, and with probability w(x,y)/(2w(x)) we jump to the neighbor y of x. We write 
P := ^(1 + D~ 1 A) for the transition probability operator of the lazy random walk on G. 

An alternative is continuous-time random walk, which has the transition matrix D~ l A, but 
rather than take steps at each positive integer time, it takes steps at the times of a Poisson process 
with rate 1. In other words, the times between steps are IID random variables with Exponential 1) 
distribution. These random walks behave very similarly to the discrete-time lazy random walks, 
but they move to new vertices twice as fast on average. The mathematics is slightly cleaner for 
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continuous time than for discrete time; sometimes one can derive bounds for one from bounds for 
the other, but often it is easier simply to follow the same proof. 

It is easy to see that for every finite graph G, the kth largest eigenvalue of P is equal to 1 minus 
half of the fcth smallest eigenvalue of C. That is, the eigenvalues of P are 

1 = 1- Ai/2 > 1 - A 2 /2 > . . . > 1 - A n /2 > . 

For two vertices x, y G V, we use pt(x,y) to denote the probability that the discrete-time 
random walk started at x goes to y in exactly t steps. Observe that pt(x,y) = (P t l y ,l x ). For a 
finite, connected graph G, let ir(-) be the stationary distribution of the walk. It is elementary that 
ir(x) = w(x) / wt(V) for all x G V . For every p > and e > 0, the LP -mixing time of the walk is 
defined as 

l/p 

Pt(x,y) 



mm < 




1 



7r(y) 



< e 



For p = oo, one defines 

T^o(e) := min ^ / : V.r. ,y G V 
It is elementary that for every e > 0, 



Pt(x,y) 



< e 



rroo(e)/2l = r 2 (Vi) = min (t : Vx G V P2t{ -^f> < 1 + e) 



(4.3) 



We present a self-contained proof in Proposition A.l. 

We use qt(x,y) for the probability that the continuous-time random walk started at x is at y 
at time t. We have 



qt(x,y) = (e 



-tc 



-Af 



d(I £ (A)l yj l a 



(4.4) 



One defines L p -mixing times for continuous-time random walks in the same way as for discrete-time 
random walks. In this case, (4.3) holds without the ceiling signs. 

We can use the spectral measure of the Laplacian to upper bound the return probability, or the 
mixing time, of the random walks. Recall that when G is infinite, tt(x) := 0. 

Lemma 4.6. Let G be a weighted graph. For every vertex x G V and t > 0, we have 

Pt (x, x) = tt(x) + l - J\l - A/2)*-V*(A) d\ . 
Iffi*(X) < tp(X) for some increasing continuously differentiable function if] with i/j(0) = 0, then 



Pt(x,x) < ir(x) + / (1 - A/2) V(A) dX < ir(x) + 
Hence if G is finite, then 



-At/2,/./ 



■0 (A) dX. 



Y, x Pt{x,x) - 1 _ £ 
n 2 



(1- X/2y- l n*{X)dX 
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and if /U*(A) < for some increasing continuously differ entiable function ip with V'(O) = 0, then 
J2 x Pt(x,x)-_L^ [\ 1 _ X/2 y^(X)d\<7r(x)+ F e- xt l 2 ^'(\)d\. 



n JO JO 

Proof First, since P = £) -1 / 2 (J - C/2)D 1 / 2 , we have 

pt(x,x) = (P*l x ,l*) = (D-V^I-C/ZfD 1 '*!^) = {{I-C/ifD^l^D- 1 ' 2 !, 

= ((/ - £72)*!^, l x ) . 



Symbolic calculus gives 



Therefore, by (4.1), we get 



(/-£/2)*= / 2 (l-A/2)*d/ £ (A). 
Jo 



pt(x,x)= [ (l-A/2) < d(/ £ (A)l a; ,l a; ) =tt(x)+ [ (1-A/2)'^:.(A) 

JO JO 

= n(x)+ t -J o (l-A/2)*-V;(A)dA, 

where the third equation holds by the fact that //*(0) = 0. Therefore, if /^(A) < ?/>(A) and ip(-) is 
continuously differ entiable, 

Ptfa x) = ir(x) + l - f (1 - A/2) 4 " V* (A) rfA < vr(x) + £ Al - A/2)*"V(A) dA 

^ JO ^ JO 

r-2 



vr(x)+ / (1 - A/2)V(A)dA. □ 

JO 



For continuous-time random walk, (4.4) tells us that the return probability is given by the 
Laplace transform, i.e., 

f 2 

qt(x,x) — ir(x) = / e~ Ai d/i* (A) . 
Jo 

This makes formulas somewhat cleaner. But as we used in the preceding proof, pt(x, x) < qt/2(x, x). 

An upper bound on spectral measure gives an upper bound on return probabilities, as in 
Lemma 4.6. The reverse is true as well, as noted by [ES89, Proposition 5.3]: 

Lemma 4.7. Let G be a weighted graph. For every vertex x £ V , we have 

/4(<5) < 2e(p [2/si (x,x) - vr(x)) for < 6 < 1 

and 

/4(<5) <e(q 1/s (x,x)-ir(x)) for 0<8<2. 
See the appendix for a proof. 

See also [GS91, Appendix 1] for a comparison of the asymptotics of n%{6) for small 5 with the 
asymptotics of pt(x, x) — ir(x) for large t. In many commons situations, each asymptotic determines 
the other. 

We shall generally state our results only for discrete-time random walks, but analogous results 
follow from similar proofs for continuous time. 
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4.3 Embeddings of Graphs 

We start by describing general properties of every embedding of a graph G into a (real or complex) 
Hilbert space H. Let F: V — > rl be an embedding of G. (Note that by "embedding", we do 
not imply that F is injective; it is merely a map.) We say that F is centered if G is finite and 
Szev F(x)w(x) = 0. We also say that F is non-trivial if F{x) 7^ for some x G V. For a vertex 
x G V and radius r > 0, we use Bp{x,r) := {y G V : ||.F(x) — < ^} to denote the set of 

vertices of G contained in a ball of %-radius r about x. 

For a subset E' of edges of G, we define the energy of F on E' as 

£ F (tf):= E ^,y)ll^)-^(y)f • (4-5) 

Roughly speaking, the energy of a subgraph of G describes the stretch of the edges of that subgraph 
under the embedding F. For a set S C V of vertices, we define the energy of S as Sp(S) := 
£f(E n (5 x S 1 )). If we use the weight of an edge as its conductance, then we can relate energies 
to effective resistances of the corresponding electrical network: For a finite graph G, we define the 
effective conductance between a pair of vertices s,t G V as 

„ , . . £ f (E) . E x ~yHx,y)\f(x)-f(y)\ 2 

C e ff(s,c) := mm — = mm 



/e*(v) 1/(5) -/Wl 2 |/(a)-/(*)l 2 
/(«¥/(*) /(«)*/(*) 

This is for a scalar-valued function /, but by adding the squares of coordinates, the same holds for 
vector-valued functions F in place of /. The effective resistance TZ e s(s, t) is the reciprocal of the 
effective conductance. We also use • — sup s 72. e g(s, t) to denote the maximum effective 

resistance of any pair of vertices of V, the effective resistance diameter of G. Similarly, define 
7?. c iiam(' s ) := max£ 7£ e ff(s, t). It is well known that the expected time for the non-lazy random walk 
to go from s to t and then back to s is equal to \Nt(V)TZ e g(s,t); this is called the commute time 
between s and t. The maximum commute time between x and any other vertex will be denoted 
:= \Nt(y)7Zdia,m(x) , while the maximum commute time between any pair of vertices will be 
denoted := wt(V)7£diam- Note that these standard formulas are all for the non-lazy random 
walk; an additional factor of 2 would apply for the lazy random walk. We refer to [LP 13, Chap. 2] 
for more background on electrical networks. 

The following lemmas are used in several of our proofs. 

Lemma 4.8. For every non-trivial centered embedding F: V — > rl of a finite graph G, we have 
B F {x,\\F{x)\\)^V. 

Proof. If F{x) = 0, the statement follows since F is a non-trivial embedding. So assume ||.F(x)|| 7^ 0. 
For the sake of contradiction, suppose Bp{x, [|jF(x)|| ) = V. Then for every vertex y G V, we have 
(F(x),F(y)) > 0. Since F is centered, we have 

0=X: w(y)(F(x),F(y)) > w(x)(F(x), F(x)) > , 
xev 

a contradiction. □ 
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Lemma 4.9. Suppose that w(x,y) > 1 for all edges (x,y) G E. Let F: V — > H be any embedding 
of G into a Hilbert space %, and let B := Bp(x,r). If V C E is a path in G from x to a vertex 
outside of B, then 

^ V)> -W\-\BY 

Proof. Let V = (2/0,2/1,2/2, • • • , 2/Z-i,2/j), where yo = x and yi ^ B. Then by the Cauchy-Schwarz 
inequality, we have 



£f(V) > £ \\F{vi) - F(y l+1 )\\ 2 > y ~ F (yi+l)\\) > J \\F(yo) ~ F( yi )\\ 2 > 

i=0 \i=0 / 



where the first inequality uses the assumption that w(x,y) > 1 for all edges (x,y) G E, the third 
inequality follows by the triangle inequality in Hilbert space, and the last inequality follows by the 
assumption that yi ^ B. □ 

4.4 Spectral Embedding 

For 8 G (0, 2), we define the spectral embedding F: V — > £ 2 (V) of G by 

F(x) := I*(5)e x = —L=r(5)l x . 

\Jw{x) 

For finite graphs, another way to view this embedding is as follows. Suppose that 8 = ^ ^k+i 
and that gi, g2, ■ ■ ■ , 9n- V — > K is an orthonormal basis of £ 2 (V) such that g2, . . . , gu span the image 
of I* (8). For example, gj could be a A j -eigenvector of C. Because F(x) lies in img (l*(<5)), we may 
write F{x) in the (gi, . . . , <? n )-coordinates as 

F(x) = (0,/ 2 (x),...,/ fc (aO,0,...,0). 

In order to calculate fj(x), we write 



y/w(x) yjw{x) yjw(x) \J w{x) 

One could, therefore, work simply with (fi(x), . . . , fk( x )), and translate all our proofs for finite 
graphs into such language. This was done in Section 3 for finite regular graphs as an illustration. 
For infinite graphs, one could use infinitely many vectors fj, but they would not be eigenvectors. 

Lemma 4.10. For every finite graph G, the above F is centered. 

Proof. First observe that CD 1 / 2 ! = 0. Since I* (8) projects into the orthocomplement of the kernel 
of C, we get P(8)D l / 2 l = 0, whence 

Y^w[x)F{x) = \/Mx)I*{8)l x = I*{8)D l ' 2 l = 0. □ 
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Lemma 4.11. For every finite or infinite graph G and every vertex x G V, 

w(x) \\F(x)f = yJw[x)F(x)[x) = n%{8) . 

Hence, for every finite graph G, 

Y J \\F(x)\\ 2 w(x) = Y J »m = n^(S). 

Proof. The proof simply follows by the definition of spectral embedding: 

(P(5)l x ,lx) _ /4(«5) 



and 



F{x)(x) = (I*(5)e x ,l x ) = 
\F(x)\\ 2 = (I*(6)e x ,I*(6)e x ) 



\/w(x) \/w(x) 
(I*(S)1 X ,I*(S)1 X ) _n* x (6) 



□ 



w(x) w(x) 

For a set S C V, let n* s (5) := Y^xeS Pxffl' ^ ne P ro °f °f the next lemma is based on [LOT12, 
Lemma 3.2]. 

Lemma 4.12. For every S G (0,2), the spectral embedding F enjoys the following properties: 

i) For every unit vector f G img (/*(£)), we have YlxeV w ( x )\ (/> F( x )) f = 1- 

ii) For every vertex x G V and r := a ||F(a;)|| with a > 0, we have 

mW)(*)< (i-L 2 ) 2 ' 

Proof. First we prove (i): 

^ w (x)|(/,f(x))| 2 = ^^K/,/*^)! 2 = £ |(r(5)/,u| 2 = i, 



where the last equality follows by the fact that ||/|| = 1 and / G img (/*(#)). It remains to prove 
(ii). First observe that for every two non-zero vectors /, g G -£ 2 (y), we have 



/ 5 










11/11 llsll 




llfll 



< 



9 ~ Tr-n-5 
5 



<2||/- 5 | 



Therefore, 



ft 



/ 5 



i- 


/ 5 




11/11 Nl 



2 ) >1 _ 2 \\f-9\ 



Now let / := F(x)/ ||-F(x)||. Since / G img (l*(<5)), we have by (i) that 

F(y) F{x) ' 2 



1 = J>( y )|(F( y ),/>| 2 > £ «;(y) 11^)11 

y&V veB F (x,r) 

/ \ 2 

|F(x) - if'-" - 



ydBp{x,r) \ 



\F(yW \\F(x)\\ 



||F(x)|| : 



□ 
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Lemma 4.13. For every finite graph G and 5 G (0,2), 



Sf(V) 
EJF(x)\\ 2 w(x) 



6 > _ n -^;/ 2 = Ray(F) . 



Proof. For every vertex x G V, define / x G £ (V) by := (F(y), l x ) for y G V. Then 



MXO _ E„~,^*)E*|<^)-n*)>i:»>| 2 _ E^(y>*)EJ/*(?/)-/*(z 



2 



E^(x)||F(x)|| 2 " E^'C^l^M*) Ea^WWy} 2 

where the last inequality holds by Lemma 4.5 and that D 1 / 2 f x G img □ 

Example 4.14. Consider again Example 4-1 of the unweighted cycle on n vertices, which we regard 
as the usual Cayley graph of7L n := Z/nZ. Choose 5 := A2 = 1 — cos(2-7r/n). We may calculate the 
embedding F:7L n ^r £ 2 (Z n ) by identifying l x G ^ 2 (Z n ) with its image e x : k 1— )■ e 2mxk ^ n under the 
Fourier transform. Then F(x): k 1— > l{±iy(k)e x /V2. The image of F is a set of n points equally 
spaced on a circle. 

Example 4.15. Consider again Example 4- 2 of the usual unweighted Cayley graph of Z. The 
embedding F:7L^? £ 2 (Z) is easiest to perceive if we identify £ 2 (Z,) with L 2 (1R/Z) (via the Fourier 
transform). Then F(x) = lB 6 e x /\/2, where e x : s 1— > e 2mxs . These points are on an infinite- 
dimensional sphere, with the inner product between F(x) and F(y) being sin ((x — y)cos _1 (l — 
5)) /(2vr(x - yj) for x ^ y and < 5 < 2. 



5 Bounds on the Vertex Spectral Measure 

Let G be a locally finite graph. This section contains two subsections. In the first, we treat worst- 
case finite graphs for eigenvalues, spectral measure, return probabilities, and mixing. In the second 
subsection, we treat the worst case graphs when a lower bound to the growth rate is imposed. Both 
sections have results for regular graphs. 

The structure of all our proofs follows the same two steps. In the first step, we bound eigenvalues 
from below by the Rayleigh quotient of a specially-chosen function in the image of a spectral 
embedding. In the second step, we bound the Rayleigh quotient from below via a geometric 
argument. The geometry will not enter in a serious way until the proof of Proposition 5.8. In 
general, when we bound the spectral measure at a vertex x, the embedding will place x at a 

location whose distance from the origin is related to fJ, x (8). The energy of the embedding is then 
bounded below by some version of the fact that other "close" vertices are embedded "far" from the 
location of x. This fact, in turn, arises from the property that the embedding is orthogonal to the 
kernel of C The meaning of "far" depends on the assumptions of the theorem desired. 



5.1 Worst-Case Finite Graphs 

We begin with a very simple proof of a lower bound on A2. It shows that the relaxation time 
(i.e., I/A2) is bounded by half the maximum commute time. This is well known; later we shall 
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improve it to show that the L°°-mixing time is bounded by a constant times the maximum commute 
time. In this proof, the first step (in the general structure of our proofs) is trivial by choosing an 
eigenfunction, and the second step is quite general. 

Proposition 5.1. For every finite connected weighted graph G, we have 

2 

A 2 >— • 



In particular, if G is unweighted, then 



A 2 > 



n(n — 1) 



2 



Proof. Let g be a unit-norm eigenvector of C. Write / := D 1 ' 2 g. Then 

\2 = £f{E)>\f{x)-f{y)\ 2 C eS {x,y) 
for all x y. Since g _L y/w, we obtain 



K diam \ 2 >^ f w(x)w^)\f(x) - /(y)| 2 /wt(F) 2 = —— 2 = — - 

x,y ^ ' ^ ' 

as desired. In the unweighted case, we use the fact that wt(V) < n(n—l) and T^diam ^ diam < n — 1 
(as in Lemma 4.9). □ 

The maximum commute time is known (see [CFS96] for a simple proof) to be at most 4n 3 /27 + 
o(n 3 ) if G is unweighted, whence 

27 + (l) 

A2 -^3— ' (5 ' 1} 

As is well known, this bound is sharp in various ways up to a constant factor. For example, 
[L081] show that the barbell graph, which has [n/3\ vertices in each of two cliques and n — 2[re/3j 
vertices in a path that joins the two cliques, has A2 < 54/n 3 + 0(l/n 4 ). 

One can regard the preceding proof as using the 1-dimensional embedding / : V — > R. In the 
rest of the paper, we use higher-dimensional embeddings F to bound the spectral measure at a 
vertex. However, in this section we still use only a 1-dimensional relative of F, whereas later 
sections depend crucially on using the full F. 

Proposition 5.2. For every finite, connected graph G with w(x,y) > 1 for all edges (x,y), we 
have for every vertex x £ V, 

/4(<5) + < Tld\aia{x)5w(x) < (n - l)5w(x) . 

Therefore 

1*1* (5) + 1/n < lZdia.mU>0~ < {n — l)w5 

and 

k k 
A fc > — > 



C ~ in- l)wt(y) 
where w := wt(V)/n. 
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Proof. Recall that F(x) := I*(5)e x . Define /: V -> R by 




In the next claim we describe some of the properties of the 1-dimensional embedding /. 
Claim 5.3. The function f defined above satisfies 

i) ll/L = l, 

U) f( x ) = ^Jn* x {5)/w{x), 

in) D^f = thus D^f e img (/*(£)). 

Proof. First, since F(x) £ img (/*(#)), by Lemma 4.12, \\f\\ w = 1. Second, by Lemma 4.11 we have 
/0) = ||F(x)|| = y/ /j,l(5) /w(x). Third, by the definition of /, D l l 2 f = Jpgtr. This is because for 
every y, f(y) = (p^,^). □ 

By (i) and (iii) above and Lemma 4.5, we have for each y £ V, 

5 > (L/, /) = £/(£) > - /(y)| 2 /^diam(x) • 

Therefore, 

^diam(x) > 5>(y)|/(z) -/(y)| 2 /wt(y) 

= /(I)2 + SEJvj - ^y) '<*> S - W/ W = / W 2 + s^j 

since D 1 / 2 f _L ^/to. Use of (ii) above now gives the first inequality, n%(5) + tt(x) < , R.^ ajUL (x)8w{x) . 

Furthermore, since w(y, z) > 1 for all adjacent pairs of vertices, the conductance of each edge is 
at least 1. Therefore, since G is connected, the effective resistance of each pair of vertices is at most 
n — \ (as in Lemma 4.9). Hence, lZd\a,m ( x ) <n — l. This completes the proof of Proposition 5.2. □ 

It is known that the L°°-mixmg time is bounded by the maximum hitting time (see the middle 
display on p. 137 of [LPW06]), which, in turn, is at most the maximum commute time. More 
precisely, [LPW06] shows that 

Pt(x,x) _ 1 < J2 y ^{y)^y[Tx] 
tt(x) ~ t 

where T x is the first time the lazy random walk visits x. This result is due to Aldous. We give 
another proof here that the L°°-mixing time is bounded by the commute time, which we use to 
answer open questions on the smallest log-Sobolev and entropy constants. 
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Corollary 5.4. For every unweighted, finite, connected graph G, we have 

Too(l/4) < 2[8|^|^ diam ] <8n 3 . 
More generally, if G is a finite connected weighted graph, then for all x £ V(G) and for all t > 1, 



whence 



Proof. By Lemma 4.6, we have 



Pt(x,x) _ 1 2t| 

7r(x) t 

r 00 (l/4)<2[4C 



(5.2) 



Pt(x,x) 
tt(x) 



1 < 



1 



7T X 



V - X/2)\n^ m (x)w(x)X)'d\ = ^ li - ( ''' ) ' r( - r ) 



7r (a;) 



(1- A/2)*dA 



C l\l-\/2) t d\<t^ f 
Jo Jo 



- xt ' 2 dX 



2tL 



If i := 2|"4ttw] > then this is at most 1/4, whence Too(l/4) < i by (4.3). In the unweighted case, we 
use the fact that wt(F) = 2\E\. □ 

As is well known, the barbell graph has f2(n 3 ) L 1 -mixing time. (This follows from the bound 
on A 2 of [L081] and, say, [LPW06, Theorem 12.4].) 

The 5th open question in [MT06] asks how small the log-Sobolev and entropy constants can be 
for an n- vertex unweighted connected graph. We can now answer this (up to constant factors). We 
first recall the definitions. Define Ent 7r (/) := (f,log(f/(f,ir))) n . The entropy constant is 



Po(G) := inf 



<Wog/> 



/ EnW / 

where the infimum is over /: V — > (0, oo) with Ent^ / ^ 0. The log-Sobolev constant is 

(CfJ) 



where the infimum is over / : V 



and [MT06, Theorem 4.13] that 



p(G) := inf ■ 

/ EnW/ 2 

with Ent^ f 2 ^ 0. It is known [MT06, Remark 1.11] that 
4p < po < 2A 2 



2p > l/r 2 (l/e) . 

In the latter case, continuous-time random walk is used. As noted in [MT06], the first of these 
inequalities implies that mine p(G) = 0(n~ 3 ) and mine po(G) = 0(n -3 ) because of the example 
of the barbell graph cited earlier, where the minima are over n-vertex unweighted graphs. On the 
other side, the continuous-time analogue of (5.2), namely, 



Qt(x,x) 

7I"(x) 



1 < 



yields that T2(l/e) < e 2 n 3 /2, which combined with the second inequality above gives p > l/(e 2 ra 3 ) 
and po > l/(e 2 n 3 ). Thus, we have proved the following: 
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Theorem 5.5. For finite unweighted graphs G with n vertices, we have 

min /9(G) = 0(n -3 ) and min po(G) = 0(n -3 ) . 

G G 

For regular unweighted graphs, we may reduce the mixing bound 0(n 3 ) of Corollary 5.4 to 
0(n 2 ). To see this, we use the following well-known bound on growth of regular graphs. Bounds 
on the diameter of regular graphs go back to [Moo65], but he uses a different approach. 

Lemma 5.6. For every unweighted, connected, d-regular graph G, x G V and 1 < r < diam(x), 
we have wt(x,r) > d 2 r/3. In particular, diam(:r) < 2>n/d. 

Proof. Let B = B^ ist (x,r). Choose y G B such that dist(x,y) = r. Let V = (yo,yi, ■ ■ ■ ,y r ) be a 
shortest path from x to y. Let S = {yo, y^, ye, ... , y 3 \ ( T -_i)/3j }. Since V is a shortest path from x to 
y, no vertex of S is adjacent to any other vertex of S, and no pair of vertices of S have any common 
neighbors. Moreover, since for each z £ S, dist(x,z) < r, each vertex of S is adjacent only to the 
vertices inside B. Therefore, since G is d-regular, every vertex of S has d — 2 unique neighbors in 
B\V that are not adjacent to any other vertices of S. Hence 

| B |>|P| + |S|(d-2)>( r+ l) + <^>M±il^. 

Since G is (f-regular, we get wt(B) = wt(x,r) > d 2 r/3. □ 
Corollary 5.7. For every unweighted, finite, connected regular graph G, we have 

Too (1/4) < 24n 2 . 

Proof. Let d be the degree of G. Since \E\ = nd/2 and T^diam < diam < 3n/d, the inequality is 
immediate from (5.2). □ 

As is well known [MT06, Example 2.11], ^(1/4) = G(n 2 ) for a cycle on n vertices. 
We remark that the same bound as in Corollary 5.7 holds with an extra factor of the maximum 
degree over the minimum degree for general finite unweighted graphs. 



5.2 Volume Growth Conditions 

We now prove stronger bounds that depend on lower bounds for volume growth. The proof has 
some similarity with that of [BCG01, Lemma 2.4]. 

Proposition 5.8. For every finite or infinite graph G with w(x,y) > 1 for all edges (x,y), and 
every vertex x G V, 5 G (0, 2), a G (0, 1), 

w „. 5w(x) , , . w(x) 

/4 5 <— j^r when wtx,r )>- ^— - . 5.3 

a (1 - aYn*{5) 



Thus, 



/4(*)<-^\ for 6<— * (5.4) 
\Nt(x,r) r\Nt(x,r) 
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Proof. Let / be as defined in Proposition 5.2, and let B := Bf(x,af(x)j. If G is finite, then / is 
centered, so there exists a vertex outside of B by Lemma 4.8. If G is infinite, then there exists a 
vertex outside of B by Claim 5.3 (i). Let V be a shortest path from x to a vertex outside of B (since 
G is connected, V is well defined). Since D 1 / 2 f G img (/*(<5)) by Claim 5.3, we have by Lemma 4.5 
that 

where the third inequality holds by Lemma 4.9. Let -B' := S^stC^i IT 3 ! — 1). By definition of V we 
have B' C 5. Since 

wt(B)(l - a) 2 / 2 (x) < £ / 2 (yMy) < £ f 2 (y)w(y) = ||/|| 2 = 1 , 

we obtain 

Wt(x, \V\ - 1) = Wt(fi') < Wt(B) < X9J9/ r = W^TTTT • ( 5 - 6 ) 

In (5.3), we have r > \V\. Therefore, (5.3) follows from (5.5). 

If we combine the two inequalities wt(x,r) > ^J^y^, ^ and /x*(<5) < ^^f^ r for q = 1/2, then 
we obtain that the first of them implies that r5 > l/wt(x,r). The converse of this is (5.4). □ 

For infinite graphs, this can be compared to [LPW06, Theorem 21.18] (due to [BCK05, Propo- 
sition 3.3]), a version of which can be stated as 

Pt(x,x)<^\ (5.7) 
wt(x, r) 

for t > r ■ wt(x,r). This result implies (5.4) with "4" replaced by "6e" via Lemma 4.7. 
Next we describe some of the straightforward corollaries of the above theorem: 

Corollary 5.9. For every finite or infinite connected graph G with w(x,y) > 1 for all edges (x,y) 
and every x £ V and 5 S (0, 2), 

fJ>x{d) — max {w(x)Vl25, 2w(x)/ diam(x)} . 

Proof. Since w(y,z) > 1 for all adjacent pair of vertices, for any path V of length r, we have 
wt('P) > 2r. Thus, wt(x,r) > 2r. Therefore, by Proposition 5.8, for a = 1/2 and r = [^Tpy], we 
get, provided that r < diam(x), 

2w(x) 



Hl(6) < A5w{x){r + 1) < A5w{x) 



, I ^ I25w 2 (x) 



where the last inequality holds by the fact that w(x) > 1 and /u* (<5) < 1. If, on the other hand, 

2w(x) 



r > diam(x) + 1, then > diam(x), which completes the proof. □ 



For regular unweighted graphs, we can remove the dependence above on w(x). It appears that 
this result is new. 
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Theorem 5.10. For every unweighted, connected, regular graph G and every x £ V, we have 
H%{5) < 10VS. Hence, n*(6) < 10\/<5. If G is finite, for 2 < k < n, we get 

A > {k ~ 1)2 



lOOra 2 

For all t > and x £ V , we have 

Pt(x,x) - 7r(x) < . 

Proof. Let / be as defined in Proposition 5.2 and 5 := Bf(x,af(x)} for a = 1/2, and let P be a 
shortest path from x to the outside of B. Write d for the degrees of the vertices of G. We want 
to show that wt(-B) > Q(d 2 \V\), and then the proof that fi*(5) < 10^/5 follows by equations (5.5) 
and (5.6). Unfortunately, the former may not hold in the case \V\ = 1. Suppose that \V\ = 1 and 
wt(B) < d 2 /2. Then, it must be that at least half of the neighbors of x are outside of B. Therefore, 

o > (Lf, f) > £f{B) > = — — , 

and we are done. 

So, if \V\ = 1 we may assume that wt(B) > d 2 \V\/2. 
If \V\ > 2, then by Lemma 5.6, 

vjt(B) > wt(x, \V\ - 1) > d 2 (\V\ - l)/3 > d 2 \V\/6 . 

Thus, we may assume the above equation holds for all \V\ > 1. Now by plugging this into (5.6), 
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we get \V\ < . 24 (s) . Finally, by (5.5) we obtain 



d\V\ ~ 24 100 

Since the above equation holds for every vertex x € V, it holds also for the spectral measure of G 
as well. Thus the inequality /J%.(5) < lOyS follows by an application of Fact 4.4. This inequality 
immediately implies the other two on the spectrum. 

Finally, the bound on return probabilities follows from Lemma 4.6: 

2 5 _ [2 f°° _ Q , 5^ 13 



p t (x,x)-n(x)< / e - xt / 2 ^d\<5J- / e ~ s s~ l l 2 ds = < ^= . □ 

Jo vA V t Jq y/t yt 

Again, we remark that the same bound holds with an extra factor of the maximum degree over 
the minimum degree for general unweighted graphs (or the reciprocal of this factor for the lower 
bound on Afe). 

Of course, the example of cycles shows that the bounds are sharp up to constants. 

We may also illustrate Proposition 5.8 by choosing common growth rates, as in the following 
two corollaries. The bound on return probabilities in the first corollary is the same as [BCG01, 
Example 2.1], except for the constant, which was left implicit in [BCG01]. (Note that all their 
results on graphs, including Theorem 2.1, require the hypothesis that w(x) be uniformly bounded. 
This was assumed in [Cou96, Proposition V.l] that they used.) The result is sharp up to a constant 
factor for every a > 1, even for unweighted graphs with bounded degree, as shown by [BCG01, 
Theorem 5.1] in combination with Lemma 4.7. 
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Corollary 5.11. Let G be an infinite graph with w(x,y) > 1 for all edges (x,y) and x E V. 
Suppose that c > and a > 1 are constants such that for all r > 0, we have wt(x,r) > c(r + l) a . 
Then for all 5 G (0,2), 

£(6) < Cw(x)6 a « a +V , 



where 

Hence for all t > 1, we have 
where 



(3/2) a /( Q+1 )(a + l) 2 
:_ c l/(a+l) a 2 

Pt(x,x) < C'w{x)t- a/{ - a+l) , 

= 3 a /( a + 1 )(a + l) / a 

C V(a+l) a ^ a + 1 



See the appendix for a proof. 

For example, we may always take c = a = 1, in which case we obtain the bounds < 
4y / 3/2w(x)5 and pt(x,x) < 2\^3ttw(x) / '^ft. For comparison, [Lyo05, Lemma 3.4] gives the slightly 
better bound pt(x, x) < 2w(x)/\/t + 1. 

Similarly, one can prove the following: 

Corollary 5.12. Let G be an infinite graph with w(x,y) > 1 for all edges (x,y) and x G V. 
Suppose that c\,C2,a > are constants such that for all r > 1, we have wt(x,r) > c\e C2ra . Then 
for all 5 € (0, min{2, cJ a /{2 Cl e)}) , 

/ l/a \ l l a 

Vm<8c- 1/a w(x)5hn^-\ . 



6 Bounds on Average Spectral Measure 

In the preceding section, we proved an 0(5) bound on /J,* (5), but the implicit constant depended on 
the graph. In the regular unweighted case, we obtained an 0(yS) bound with a universal constant. 
Here, we obtain an O^S 1 ^ 3 ) bound with a universal constant for all unweighted graphs. This answers 
a question of [Lyo05] (see (3.14) there) and has an application to estimating the number of spanning 
trees of finite graphs from information on neighborhood statistics; see below. No such bound on 
H%{5) for individual vertices x is valid, however. 

Theorem 6.1. For every finite, unweighted, connected graph G, and every 5 G (0,2), we have 
H*(8) < 14.8<5 1//3 , whence for 2 < k < n, we have 



3200n 3 



For each k, this is sharp up to a constant factor as shown by the following example: We may 
assume that k < n/6. Let G consist of k cliques of size ~ 2n/(3k) joined in a cycle by paths of 
length ~ n/(3k). For each i = 1, . . . , k, define fi to be the function that is 1 on the ith clique 
and goes to linearly on each of the paths leaving that clique, reaching at the midpoint. It 
is straightforward to calculate that Ray(/^) ~ 27/c 3 /n 3 . Since the supports of all fi are pairwise 
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separated, i.e., no vertex in the support of fi is adjacent to any vertex in the support of fj for i ^ j, 
the same asymptotic holds simultaneously for the Rayleigh quotient of every function in the linear 
span of the fi, whence < (27 + o(l))/c 3 /n 3 . 

We prove the above theorem by showing that Ray(F) = S7(/i(<5) 3 ). Our proof is a generalization 
of the proof of Proposition 5.8. Here, instead of just lower-bounding the Rayleigh quotient by 
considering a ball around a single vertex, we take f2(/c) disjoint balls about $l(k) vertices chosen 
carefully so that their spectral measure is within a constant factor of the average. This requires us 
to use the higher-dimensional embedding F, not merely its 1-dimensional relative /. 

Let k := [fi*(5)n/2\ + 1. We use Algorithm 1 to choose k disjoint balls based on the spectral 
embedding of G. 

Algorithm 1 Ball-Selection(a) 
Let S <- V. 
for % = 1 — > k do 

Choose a vertex Xi in Si-i that maximizes fi*.(5). 
Let Si^ S i ^ 1 \B F (x i ,a\\F(x l )\\). 
end for 

return B F (x 1 ,a\\F(x 1 )\\), . . . ,B F (x k ,a \\F(x k )\\ ). 



The next lemma shows properties of Ball-Selection that will be used in the proof. In the rest 
of the proof, we let a := 1/4. 

Lemma 6.2. The returned balls satisfy 

i) for each 1 < i < k, ^* x .{5) > /i*(<5)/3 and 

ii) for every 1 < i < j < k, 

b f (x u | HFO^Il) n b f (x jt | im^OH) = • 

Proof. First observe that by property (ii) of Lemma 4.12, for each 1 < i < k, we have 

MB F (* i ,a||f(a!i)||)( <y ) - I _ 4a 2 = 4 /3 • 

Since fJ-* S(j (5) = ^y{5) = nfi*(5), and by the above equation, the spectral measure of the removed 
vertices in each iteration of the for loop is at most 4/3, we obtain 

1**8^(8) > nv*(6) — (k — 1)4/3 > n»*(6)/3, 

where last inequality holds by the definition of k. Since x\ has the largest spectral measure in 
for 1 < i < k, we have 

This proves (i). Finally, (ii) follows simply by the fact that each center Xi is contained only in its 
own ball and none of the other k — 1 balls. □ 
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In the rest of the proof, let Bi := Bp(xi,a \\F(xi)\\ /2) for all 1 < i < k. In the next lemma, we 
prove strong lower bounds on the energy of every ball Bi. Then we shall bound the numerator of 
the Rayleigh quotient of F from below simply by adding up these lower bounds. 

Lemma 6.3. For every 1 < i < k, 



2001 Bi I 2 



Proof. We consider two cases. If w(xi) < \Bi\, then we lower-bound £(B{) by measuring the energy 
of the edges of a shortest path from Xi to the outside. Otherwise, we simply lower-bound £{Bi) by 
the stretch of edges of Xi to its neighbors outside of Bi . 

Since F is a centered embedding by Lemma 4.10, there is a vertex outside of each ball Bi by 
Lemma 4.8. Let Vi be the shortest path with respect to the graph distance in G from Xi to any 
vertex outside of Bi. Since G is connected, Vi is well defined. Using Lemma 4.9, we can lower-bound 
the energy of Bi by 

S(B)> ^^1 ^ {6) , £® (61) 

4|£i| 64 ■ w(xi) ■ \Bi\ 2DQ ■ w(xi) ■ \Bt\ ' { ' } 

where the equality holds by Lemma 4.11 and the second inequality holds by (i) of Lemma 6.2. By 
the above inequality, if w(xi) < \Bi\, then S(Bj) > 2 qq\b-\ 2 ' anc ^ we are done. 

On the other hand, suppose that w(xi) > \Bi\. Since G is a simple graph, at least w{xi) — \Bi\ + l 
of the neighbors of X{ in G are not contained in Bi. That is, | nbd(xj) \ Bi\ > w(xi) — \Bi\ + 1. We 
lower-bound the energy of Bi by the energy of the edges between Xi and its neighbors that are not 
contained in B{\ 

£(Bi)> \\F(x t )-F(y)\\ 2 >\nbd(x l )\B t \^\\F(x t )\\ 2 

>{w(x i )-\B i \ + l) ,>^r- 
V V %) ; 200-w(xi) ~ 200\Bi\ 

The second inequality uses the radius of the ball Bi, the third inequality follows from (6.1), and 
the last inequality follows by the lemma's assumption. □ 

Now we are ready to lower-bound Ray(-F). 

Proof of Theorem 6.1. By property (ii) of Lemma 6.2, the balls are disjoint. Therefore, X^i=i — 
n. Hence Lemma 4.13 yields 

5 > Ray(F) = ^ > — > £(Bi) 



^ 1 »*(S) > k 3 > MS) 
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2nfi*(6) ^ 200\Bi\ 2 ~ 400n 3 ~ 3200 ' 

where the second inequality follows by Lemma 4.11 and the fact that each edge is counted in at 
most two balls, the fourth inequality follows by convexity of the function s h-> 1/s 2 , and the last 
inequality holds by the fact that k > nfi*(S)/2. This completes the proof of Theorem 6.1. □ 
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As a corollary of the above theorem, we can upper-bound the average return probability of the 
lazy random walk (equivalently, the 7r-average squared L 2 -mixing time) on every finite connected 
graph. 

Corollary 6.4. For every unweighted, finite, connected graph G, and every integer t > 1, we have 

Y^xevPt(x,x) - 1 YJ_ 
n tV3 " 

Proof. By Lemma 4.6, we can write 

r2 148 f -2 



U x&V J ° 



e -Xt/2 x -2/3 dx 
1/3 rt 



< 4.94 f - ) / e~ s s- 2 / 3 ds < 17*- 1 / 3 



where the first inequality follows by Theorem 6.1. □ 

This bound is sharp up to a constant factor as shown by the example of a barbell graph. 

Our interest in this type of inequality is due to its application to counting the number t{G) 
of spanning trees of large finite graphs G. This relies on [Lyo05, Proposition 3.1], which says the 
following: 

Proposition 6.5. Suppose that G is a finite, unweighted, connected graph. Then 
logr(G) = - log (4|E|) + Y, lQ g M*) ~ Z) \ ( E x ) " l ) ■ 

x&V t>l x£V 

For the convenience of the reader, we have reproduced the proof in the appendix. 

As a consequence, we can estimate the number of spanning trees of simple graphs by knowing 
only local information. For a finite graph H with distinguished vertex o, let p r u{G) denote the 
proportion of vertices x of G such that there is an isomorphism from B^ist(x,r) to H that maps 
x to o. In [Lyo05], it is shown that the numbers Pt,h{G) determine the number r{G) of spanning 
trees of G by the infinite series above that converges at a rate determined by the average degree of 
G. In the case of simple graphs, [Lyo05] suggested that a result like Corollary 6.4 would be true, 
with the result that one has a uniform approximation to logr(G) for simple graphs: 

Corollary 6.6. Given r > 2, there is a function of the numbers p t ^h{G) and \V{G)\ for (simple 
connected) graphs G that gives | 1 logr(G) with an error less than 45/r 1 / 3 . In fact, there is such 
a function that depends only on the map (x,t) i— >■ (w(x),pt (x, x)) on V x [l,2r). 



Proof. Fix r > 2. Then 



log t(G)+ log (41^1) - ^log2u;(x)+ Yl t(Z^( x ' x ) 



x€V l<t<2r x&V 



|(E»M-i)<iviEa<i4 



t>2r x&V t>2r 
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Knowing the ball of radius r about x determines pt(x; G) for all t < 2r. Of course, the distribution 
of the degrees w(x) is determined by the neighborhoods of radius 1 and 2|£7(Cr)| = w(x). Thus, 
the desired function is 

/ 2r- 1 \ 

\V\~ l -log(4|£;|) +^log2^)- Y, t(E^O^)-i) • □ 
V xev t=i x&v J 

For the next corollary, we design a local algorithm that approximates the number of spanning 
trees of massive graphs. Our algorithm uses only an oracle satisfying the operations: select a 
uniformly random vertex of G, select a uniformly random neighbor of a given vertex x, return the 
degree of a given vertex x. The algorithm also uses knowledge of n and \E\. For any given e > 0, 
it approximates ^logr(G) within an e-additive error using only 0(poly(e _1 logn)) queries. 

Corollary 6.7. Let G be an unweighted, finite, connected graph. Given an oracle access to G 
that satisfies the above operations, together with knowledge of n and \E\, there is a randomized 
algorithm that for any given e,5 > 0, approximates logr(G)/|V| within an additive error of e, with 
probability at least 1 — 5, by using only 0(e~ 5 + e -2 log 2 n) log S^ 1 many oracle queries. 

Proof. Chooser := [90e~ 3 ] , so that 45r~ 1 / 3 < e/2. Write s := Ei<t< 2 r 1 / t Let w '■= k lo S 2w ( Q 
and Y := ^ ~ Etli 1 Pt(y , v) I '(«*) • Then by Corollary 6.6, 



rj^ ^m) +w _ sY+ s_ 



n n n 



< e/2. 



Therefore, we just have to approximate W — sY within an additive error of e/2. The details of the 
algorithm are described below. 

Algorithm 2 Approximate Spanning Trees (e) 

Let r ^— |~90e~ 3 ] and s ^— Y^i<t<2r V*- 

^— [641og(l/5)s 2 /e 2 ]. 
for i = 1 — > N do 

Let x be a randomly chosen vertex of G. 

Sample 1 < t < 2r with probability 1/st. 

Run a i-step lazy simple random walk from x, and let Yi <— I [Xt = x\. 
end for 

Sample [256 log (1/5) (log n) 2 /e 2 ] random vertices of G, and let W be the average of the logarithm 

of twice the degree of sampled vertices. 

return -n" 1 log(4| J E|) + W - s(Yi + ...+ Y N )/N + s/n. 

We start by describing how to approximate Y within an e/4s error (hence, to approximate sY 
within an e/4 error). We use a Monte Carlo sampling method. Let Xo, Xi, . . . , Xt represent a 
t-step lazy simple random walk started from a vertex of G. Then 

2r-l 



xev t=l 

= E E E ^-/[Xi = x 1 ,...,X t = x t \X = x]. 



nst 

xev i<t<2r xi,...,xtev 

x t =x 
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Consider a random walk starting at a random vertex and lasting a random length of time. Namely, 
let T> be the distribution on walks of lengths in [1, 2r) where 



[(x , Xl ,...,x t )] = —f[Xi 



x 1 ,...,X t = x t \X = x Q ] . 



Then Y = P© [{(xo, x\, . . . , xt) ■ xt = Xq}]. First we describe how to sample from V, then show how 
to approximate Y. First we sample a random vertex x of G, then we select a random 1 < t < 2r 
with probability 1/st (note that Yli<t<2r 1/st = 1 by definition of s). Finally, we choose a i-step 
random walk started from y and compute I [Xt = x\. See the details in Algorithm 2. 

We approximate Y by sampling N := [~641og(l/<5)s 2 /e 2 ] independent elements of T> and com- 
puting their average. Let Yi := I [Xt = x] be the ith sample of V. By definition, Yi G [0, 1] and 
E[Yj] = Y. Since Y\,... ,Yn are independent, Hoeffding's inequality gives 



Yx + ... + Y N 



N 



Y 



> 



4s 



< 2exp 



e 2 N 
16s 2 



< V 2 ' 



Therefore, with probability at least 1 — (5/2, we have that s{Y\ + • • • + Y^)/N approximates sY 
within an error of e/4. It remains to approximate W within error e/4 and with probability at least 
1 — (5/2. That can be done easily by sampling 0(e _2 log(5 _1 log 2 n) independent uniform random 
vertices of G and taking the average of the logarithm of twice their degrees, W (see step 8 of 
Algorithm 2). Since \og2w{y) < 21ogra for all y £ V, again by Hoeffding's inequality we have 



\W-W\ 



e 

> - 
- 4 



< 5/2. 



Therefore, by the union bound the algorithm succeeds with probability at least 1 — 5. 

It remains to compute the number of oracle accesses. We used 0(e _2 log<5 _1 log 2 n) accesses to 
approximate W. On the other hand, we can compute each Y{ with at most 2r = 0(e~ 3 ) oracle 
accesses. Therefore, we can approximate Y with at most 



2Nr = 0(e" 5 log<5- 1 s 2 ) 



0(e- 5 log z e~Mog(5 



2 ,-1 



CM 



-5 



log 5 



many queries. 



□ 



We note that knowing \E\ is not really necessary for this algorithm, since it contributes a term 
of size 0(n _1 log n), which will be much less than e in any reasonable example where one might 
use this algorithm. 



7 Bounds for Vertex- Transitive Graphs 

Let G be a weighted, locally finite, vertex-transitive graph. We recall that G is vertex-transitive 
if for every two vertices x,y G V, there is an automorphism eft: G —> G such that 4>(x) = y. Since 
G is transitive, it is a u>-regular graph, where w = w(x) for every x G V. Therefore, C = ^L. 

For a vertex x G G and r > 0, let N(x, r) := \B ( ^ st (x, r)|. Since G is vertex transitive, iV(a;, r) = 
N(y, r) for every two vertices x, y G V. Therefore, we may drop the index x and use N(r). 

The following theorem is the main result of this section. The first part appears to be new. 
The last part is known and the factor of 2 there is not actually needed: this is proved in [DSC93, 
Corollary 1] for Cayley graphs, and one can apply the congestion method, e.g., [LPW06, Corollary 
13.24], for general transitive graphs. 
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Theorem 7.1. For every connected, weighted, vertex-transitive, locally finite graph G, every a G 
(0, 1), 5 £ (0, 2), and every x £ V, if w(x, y) > 1 for all adjacent pairs of vertices, then 



H*J8) = fj*(S) < \ ==- . (7.1) 



In addition, if G is finite, then 



2w diam 2 ^ ^ 



First we show that the spectral projections are equivariant with respect to the automorphisms 
of the graph G. Here, we identify an automorphism (j) of G with the unitary operator / i— )■ cf)f 
that it induces, where (cj)f)(x) := f(<p~ 1 (x)) for / G £ 2 (V) and x G V. Consequently, the spectral 
measures of vertices are the same for all vertices. 

Fact 7.2. Every automorphism operator (j) commutes with the Laplacian, i.e., 4>C = Ccp. Since 
1(8) is a function of C, every automorphism eft also commutes with 1(5) and I* (5). 

Proof. The first fact is clear from the definition of the Laplacian. The second follows from the 
symbolic calculus. □ 

Lemma 7.3. For every two vertices x,y G V and every 5 > 0, we have fi x (5) = n y (8), whence 
»* x (6)=fi* y (6). 

Proof. Choose an automorphism <f> such that (j)l x = l y . Since (j) commutes with I* (5), we have 
I* (5) = (/T 1 /* (5)<t>. Therefore, 

f, x (S) = (I(5)l x , l x ) = (I(8)(t>l y: <t>l y ) = (4>- l I{6)^y, ly) = (I(S)ly, ly) = Hy(S) , 

where the third equation follows by the fact that is a unitary operator. Since G is regular, we 
also get n%(5) = M*(<5). □ 

The next two lemmas show particular properties of the spectral embedding of vertex-transitive 
graphs. The first part was observed for finite graphs in [DT97, Proposition 1]. 

Lemma 7.4. For every two vertices x,y G V, we have ||jP(x)|| = ||-F(y)||. Furthermore, for every 
automorphism (j), 

\\F(x) - F(y)\\ = \\F(ct>(x))-F(ct>(y))\\ . 

Proof. Choose an automorphism such that <f>l x = l y . Since (f>I*(8) = I*(S)(j> by Fact 7.2, we 
have 

F(x) = -^=I*(5)1 X = -^=r(S)<t>l y = -$=<f>r(8)ly = 4>F(y) . 



W \ W \/W 



Since <p is a unitary operator, it preserves the norm, thus ||F(x)|| = ||F(y)||. We also proved that 
F((j)(y)) =4>(F(y)). Therefore 

\\F{cj>(x)) - F(<P(y)) || = \\(f>(F(x) - F(y)) \\ = \\F(x) - F(y)\\ . □ 
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Lemma 7.5. For every weighted vertex-transitive graph G, every vertex x G V , and 5 G (0,2), 

£>(x 5 y) - F(y)|| 2 = 2(F(x),LF(x)) . (7.3) 

Hence, if w(x,y) > 1 /or a// y ~ x, £/ien 

> max rJ; ||-F(rc) - F'~-' A]l 



nF(x)\\i 

Proof. By Lemma 7.4, we have 

^2w(x,y) \\F(x) - F(y)\\ 2 = 2w \\F(x)\\ 2 -2^w(x,y){F(x),F(y)) 

y~x y~x 



2/F(x),^ W (x,y)(F(x)-F(y))\. 
\ I 



On the other hand, 

w(x, y) (F(x) - F(y)) = -±= J] «,(x, y)(/* (5)l x - I*(5)l y ) 

= ^jS ^ U7(a?, 1/5(1* - l tf ) = r (5)Le x = LI*(5)e x = LF(x) . 

V y~x 

Putting together the above equations, we obtain (7.3). Therefore, 

max||F(x) - F{y)\\ 2 < 2(F(x), LF(x)) < 2S(F(x), F(x)) 2 w , 

where the first inequality holds by the assumption that w(x, y) > 1, and the second inequality holds 
by Lemma 4.5 and the fact that F(x) = I*(5)e x G img . □ 

Proof of Theorem 7.1. For a vertex x G V, let /3(x) := max^a, H-^Xx) — F{y)\\ 2 ■ Since G is vertex 
transitive, Lemma 7.4 tells us that (3(x) = (3{y) for every two vertices x,y £ V. Therefore, we may 
drop the x and use (3. By Lemma 7.5 and Lemma 4.11, 



maxy,^ \\F{x) - F{y)\\ 2 (5 (3 

2\\F(x)\\l ~ 2w\F(x)(x)\* 2»*(6)' 



w 

2/ 



Similarly to the proof of Proposition 5.8, for a vertex x £ V, define / G £ (V) by 

Observe that by Lemma 4.11, /(x) = \J fj,*(5)/w. Furthermore, since F(x) G img (/*(<*)), we 
have \\f\\ w = 1 by Lemma 4.12. For a G (0,1), let B := B f (x,af(x)) . Since ||/|| w = 1 and, for 
every y G B, we have /(y) > (1 — a)f(x), it follows that 

\B\ < 1 1 



w(l - a) 2 /(x) 2 (l-a)VO*)' 
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Also, let 



Let B' := B,n st (x,r). We shall show that B' C B. Since by (7.4), r > a/y/2w5, we get 

N ( J < iV(r) = < |B| < 1 



V2^y - w ' 1 - 1 1 - (l-a)VO*) ' 

The first part of the theorem follows by the above inequality 

It remains to show that B' C B. First observe that for every adjacent pair of vertices y,z G V, 
we have 

F(x) 



^>\\F{y)-F{z)\\ > 



F(y)-F(z), 



\m-m\ 



\F(x) 

Since there is a path of length at most r from x to every vertex in B', we have 



Vy^B' \f(y)- f(x)\< ry /f3 = af(x). 

Thus every y £ B' also belongs to B. This proves the main part of Theorem 7.1. 

To prove the bound on A2 when G is finite, let R be the shortest distance from x to a vertex 
y where f(y) < 0. Such a vertex y exists because F is centered by Lemma 4.10. Then as above, 
R^fi — f{ x )i i- e -> — wfiR 2 . On the other hand, from (7.4) we have /3 < 2<5/i*(<5). Combining 

these two inequalities yields /3 < 2w(35R 2 , i.e., 5 > l/(2wi? 2 ) > l/(2ii; diam 2 ). All this is predicated 
on the assumption that F(5) ^ 0, so taking 5 := A2 gives the result. □ 

Theorem 7.1 is strong enough to give known sharp results. We give two examples, with proofs 
relegated to the appendix. 

In the first corollary, we give an upper bound on the return probability of the random walks for 
graphs with polynomial (or faster) growth rate. 

Corollary 7.6. Let G be a finite or infinite, unweighted, d-regular, vertex-transitive graph with at 
least polynomial growth rate N(r) > Cr a , where C,a are constants and < r < diam. Then for 
every x G V, every 5 G (0, 2), and every t > 0, 

M *(£) < C'5 a ' 2 , 

where 

(a + 2) Q+2 (2d) a / 2 
: 4Ca« 

and 

Pt(x,x) -tt(s) < C"t- a l 2 , 

where 

_ (a + 2T^(4dr^ /a 
c SC^ 31 I2 

Without an explicit constant, such a result concerning return probabilities was first proved for 
infinite Cayley graphs in the celebrated breakthroughs of Varopoulos; see [CGP01, Corollary 7.3]. 
The case of finite Cayley graphs was done in a similar result of Diaconis and Saloff-Coste [DSC94, 
Theorem 2.3]. For results on uniform mixing time, see [GMT06]. 
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For example, the usual Cayley graph of Z a has n*(5) = |{(si, . . . , s a ) : 1 — Ylt=i cos(2-7TSj)/a < 
8}\. Since 1 - YT i=1 cos(27rs;)/a < 4vr 2 £\ s i/ a and the unit bal1 in Ra has volume 2W 2 /T(a/2), 
it follows that /4(S) > 2(a/(4Tr)) a/2 F(a/2)- 1 5 a / 2 (and is asymptotic to this as S -> 0). 

The next corollary gives comparable results for super-polynomial growth. 

Corollary 7.7. For every finite or infinite, unweighted, d-regular, vertex-transitive graph G with 
super-polynomial growth rate N(r) > c\ exp(c2r a ) for some < a < 1 and positive constants c\,c-i 
(0 < r < diam ), and every x G V , 

^ x (5)<-eM-c 3 S- a/2 }, 
ci 

where 

c 3 := c 2 (8d)- a /\ 

and 

Pt(x,x) - ir(x) < — (l + c 4 t^+2) exp{-c 4 £^T2} , 

Cl 

where 

Without an explicit constant, such a result concerning return probabilities was first proved for 
infinite Cayley graphs by Varopoulos; see [CGP01, Corollary 7.4]. The case of finite Cayley graphs 
was done in a similar result of [DSC94, Theorem 2.7]. 

The bound on the spectral measure is also proved (without explicit constants) by [BPS12, 
Corollary 1.8] for infinite amenable Cayley graphs. We remark that a better bound on the return 
probabilities can be obtained by choosing a in the proof closer to 1 or even as a function of A. 

For additional information on the spectrum of infinite groups, see [BPS12, BBP11]. 
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A Appendix: Miscellaneous Proofs 



The first proposition is based on [MP05, (16)~(19)]. 
Proposition A.l. For every finite graph G and e > 0, we have 

f \ f i w f~ \r P2t(x,x) 2 
T2{£) = mm < t : Vx G K — — < 1 + e 



7t(j;) 

Proof. For every t > and vertex x £ V, we have 



E 



Pt(x,y) 



Pt(x,y) 
n{y) 



Pt{x,y)p t {y,x) P2t(x,x) 



n(x) 



ir(x) 



where in the first equality we used the fact that pt(x, •) and ir(-) are probability distributions, and 
the second equality follows by the fact that pt{x,y)-K{x) = p t (y,x)i:{y). 
In addition, stationarity of ir gives 



P2t(x,y) - vr(y) 



!>(*) 

z 



7r(*) 
7T(z) 



Pt(z,y) 
7r(y) 

Pt(y,z) 
tt(z) 



7T(z) 



7r(z) 



I P2t{X,X) 

ir(x) 



1, 



Ip2t(y,y) 



H"(y) 



1 . 



Since t h-> pt(x,x) is monotone decreasing, the result follows. 
Proof of Lemma 4- 7. We have 



Pt(x,x) - ir(x) 



(l-\/2) t di4(\)>(l-6/2) t t4(S), 



whence 



M:w<(i-v2r*(M^^)-^))- 

Since log(l — s) > — s/(l — s) for < s < 1, it follows that 

(1 _ 5/2)-* < (l _ 5/2)- 1 exp ( ( \~^ 2 /2 ) < (1 - V2)- X e < 2e 

by choosing t := |_2 / <5J . This proves the first inequality. 
The second inequality is a little simpler: 

q t (x,x)-n(x)- - 

Substitution of i := 1/5 gives the result. 



e-^/4(A) > e- dt /4(*) 



□ 



□ 
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Proof of Corollary 5.11. Define 

W{X) x 



r : = 



c(l-a)2 M *(5) 



Since = wt(x,0) > c, we have ro > 1. Take r := [Yo] + 1. Then the hypothesis wt(x,r) > 

(i-o) V* ((5) °f P ro P os iti° n 5.8 is satisfied and r < 3ro/2. Substitution of this bound in Proposition 5.8 
with the choice a := a/ {a + 1) gives the claimed upper bound on pi%(8)- Now use Lemma 4.6 to 
get that 

,■2 

Pt(x,x)< / e- Xt / 2 Cw(x)-^—\- 1 ^ a+1 U\ 
Jo a + l 

< Cw(x)a ( 2 \ a/(a+1) ^ e - ag -i/(«+i) ds = C > w{x)t -a/{a + i) n 



a + l W 







Proof of Proposition 6.5. Write det' A for the product of the non-zero eigenvalues of a matrix A. 
As shown by [RS74], we may rewrite the Matrix- Tree Theorem as 

[the proof follows from looking at the coefficient of s in det (I — P — si) = (det 2D)~ 1 det(L — 2sD) 
and using the Matrix- Tree Theorem in its original form with cofactors]. Thus, 

logT(G) = -log(4|#|) + ^log2u;(j;) +logdet'(/-P). (A.l) 
Let Afc be the eigenvalues of P with Ai = 1. We may rewrite the last term of (A.l) as 

n n 

log det'(/ -p) = E - ^) = - E E 

fc=2 fc=2 t>l 

= -EE^ = -E7( trp4 - 1 )- 



t>l k=2 t>l 



Since trP* = Ylxev Pt(%, %), the desired formula now follows from this and (A.l). □ 
Proof of Corollary 7.6. First, note that for aj\Jlwb < diam, we have 



(l-a) 2 N(a/V2wS) C(l-a)'- 



a" 



by (7.1). In particular, this holds for 5 > l/(2w diam 2 ), whence for 5 > A2 in the finite case. Since 
H*(6) = for 5 < A2, it follows that the bound 



{2w5) a / 2 
C(l - a) 2 a° 



£ +F+^ (A.2) 
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applies for all 5 > even when G is finite. 

Now, set a := a/(a+2). The first inequality fx%{5) < C'5 a ^ 2 is immediate from (A. 2). Therefore, 
Lemma 4.6 allows us to write 



(a + 2)«+ 2 C e -Xtl2yal2-X dX 



a/2 r t 



8Ca a 


-l 


(a + 2) 


a+2 


8Ca a 


-1 


(a + 2) 


a+2 


8Ca a 


-1 







a/2 

• * • 1 '•■;)( /' • 

Proof of Corollary 7.7. As in the proof of Corollary 7.6, we may ignore the restriction on r when 
substituting the growth condition into (7.1). 

The bound on fJ,%{5) is immediate from Theorem 7.1. 

a 

Define f3(t) := C4t a + 2 . By Lemma 4.6 and Theorem 7.1, we can write 

Jo \c 1 exp{c 2 {8dX)-'ll)) C1 7„ 

where we used a = 1/2. Since A i— V —At/2 — C2(8<iA) _a//2 is a concave function, it is maximized at 

2 

A* := f^M a+2 (8d)-^T2. Therefore 

-y - c 2 (8dA)- a / 2 < - C2 (8dA,)- a / 2 = -p{t) . 



Therefore, 



Pt(x,x)< [ K ( 4 — ) d\ + -2ac 2 (8d)- a / 2 e-^ f°° dX 

Jo Vciexp(c 2 (8(iA) a l 1 )) ci 7 A>fc 



38 



